NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate

https://doi.org/10.1287/opre.2024.0854

Lin, Yifan; Wang, Yuhao; Zhou, Enlu (May 2025, Operations Research)

Theoretical Findings Validate Historical Data Reuse for Improved Policy Optimization A new study, “Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate” by Yifan Lin, Yuhao Wang, and Enlu Zhou, explores an advanced approach to reinforcement learning. The research focuses on improving policy optimization by reusing historical trajectories through importance sampling in natural policy gradient methods. The authors rigorously analyze the convergence properties of this approach and demonstrate that reusing past data enhances convergence rates while maintaining theoretical guarantees. Their findings have practical implications for applications where data collection is costly or limited, such as robotics and autonomous systems. By integrating these insights into policy optimization frameworks, the study provides a valuable contribution to the field of reinforcement learning.
more » « less
Free, publicly-accessible full text available May 14, 2026
Bayesian Stochastic Gradient Descent for Stochastic Optimization with Streaming Input Data

https://doi.org/10.1137/22M1478951

Liu, Tianyi; Lin, Yifan; Zhou, Enlu (March 2024, SIAM Journal on Optimization)

Full Text Available
Reusing Historical Observations in Natural Policy Gradient

https://doi.org/10.1109/WSC60868.2023.10407512

Lin, Yifan; Zhou, Enlu (February 2024, Zhou, Enlu)

Full Text Available
Bayesian Distributionally Robust Optimization

https://doi.org/10.1137/21M1465548

Shapiro, Alexander; Zhou, Enlu; Lin, Yifan (June 2023, SIAM Journal on Optimization)

Full Text Available
Risk-Aware Model Predictive Control Enabled by Bayesian Learning

https://doi.org/10.23919/ACC53348.2022.9867207

Li, Yingke; Lin, Yifan; Zhou, Enlu; Zhang, Fumin (June 2022, Proceedings of 2022 American Control Conference)

The performance of a model predictive controller depends on the accuracy of the objective and prediction model of the system. Although significant efforts have been dedicated to improving the robustness of model predictive control (MPC), they typically do not take a risk-averse perspective. In this paper, we propose a risk-aware MPC framework, which estimates the underlying parameter distribution using online Bayesian learning and derives a risk-aware control policy by reformulating classical MPC problems as Bayesian Risk Optimization (BRO) problems. The consistency of the Bayesian estimator and the convergence of the control policy are rigorously proved. Furthermore, we investigate the consistency requirement and propose a risk monitoring mechanism to guarantee the satisfaction of the consistency requirement. Simulation results demonstrate the effectiveness of the proposed approach.
more » « less
Full Text Available
A Bayesian Approach to Online Simulation Optimization with Streaming Input Data

https://doi.org/10.1109/WSC52266.2021.9715392

Liu, Tianyi; Lin, Yifan; Zhou, Enlu (December 2021, Winter Simulation Conference)

Full Text Available

Search for: All records